B2SG: a TOEFL-like Task for Portuguese
نویسندگان
چکیده
Resources such as WordNet are useful for NLP applications, but their manual construction consumes time and personnel, and frequently results in low coverage. One alternative is the automatic construction of large resources from corpora like distributional thesauri, containing semantically associated words. However, as they may contain noise, there is a strong need for automatic ways of evaluating the quality of the resulting resource. This paper introduces a gold standard that can aid in this task. The BabelNet-Based Semantic Gold Standard (BSG) was automatically constructed based on BabelNet and partly evaluated by human judges. It consists of sets of tests that present one target word, one related word and three unrelated words. BSG contains 2,875 validated relations: 800 for verbs and 2,075 for nouns; these relations are divided among synonymy, antonymy and hypernymy. They can be used as the basis for evaluating the accuracy of the similarity relations on distributional thesauri by comparing the proximity of the target word with the related and unrelated options and observing if the related word has the highest similarity value among them. As a case study two distributional thesauri were also developed: one using surface forms from a large (1.5 billion word) corpus and the other using lemmatized forms from a smaller (409 million word) corpus. Both distributional thesauri were then evaluated against BSG, and the one using lemmatized forms performed slightly better.
منابع مشابه
Authenticity Evaluation of TOEFL iBT Speaking Module from the Perspective of Applied Linguistics and General Education
For the first time, this study combined models and principles of authentic assessment from two parallel fields of applied linguistics as well as general education to investigate the authenticity of the TOEFL iBT speaking module. The study consisted of two major parts, namely task analysis and task survey. Utilizing Bachman and Palmer’s (1996) definition of authenticity, the task analysis examin...
متن کاملEFL Learner’s Evaluation of Writing Tasks in Iran’s TOEFL and IELTS Preparation Courses in Light of the Process-oriented Approach
The purpose of this research was to analyze EFL writing tasks in two of the most popular English for Speakers of Other Languages (ESOL) exam preparation courses in Iran, namely IELTS and TOEFL. Having collected the criteria of writing task appropriateness in light of the process-oriented approach to writing instruction, we asked 60 learner participants to rate EFL writing tasks in 3 IELTS and 3...
متن کاملError Taxonomy of TOEFL iBT Writing: An Iranian Perspective
TOEFL iBT has turned recently heads to the impacts language tests can have on language learning. Since error analysis-based instruction has gained a new life with the advent of the computer analysis of the learner’s language, the researchers of this study embarked on examining a sample of integrated and independent writing tasks of 45 Iranian TOEFL iBT candidates in order to identify and classi...
متن کاملTask Type and Prompt Effect on Test Performance: A Focus on IELTS Academic Writing Tasks
Recent versions of international high-stakes tests like TOEFL and IELTS have made use of integrated tasks in addition to the traditional independent tasks in a claim to provide a more realistic estimation of the test takers’ language abilities. The present study aimed to investigate how test takers’ performance may differ on such tasks. As such, the test takers’ performance was compared on IELT...
متن کاملTOEFL iBT integrated speaking tests: a comparison of test-takers' performance in terms of complexity, accuracy, and fluency
This study compares three integrated tasks of the TOEFL iBT speaking subtest in terms of complexity, accuracy, and fluency. To this end, a group of TOEFL iBT Iranian candidates took a simulated TOEFL iBT some days prior to their real exam. The collected oral responses were first transcribed and then quantified using software such as ‘Syllable Counter’ and ‘Coh-Metrix3’ for fluency and complexit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016